TANGO: Bilingual Collocational Concordancer
نویسندگان
چکیده
In this paper, we describe TANGO as a collocational concordancer for looking up collocations. The system was designed to answer user’s query of bilingual collocational usage for nouns, verbs and adjectives. We first obtained collocations from the large monolingual British National Corpus (BNC). Subsequently, we identified collocation instances and translation counterparts in the bilingual corpus such as Sinorama Parallel Corpus (SPC) by exploiting the wordalignment technique. The main goal of the concordancer is to provide the user with a reference tools for correct collocation use so as to assist second language learners to acquire the most eminent characteristic of native-like writing.
منابع مشابه
Computer Assisted Language Learning Based on Corpora and Natural Language Processing: The Experience of Project CANDLE
This paper describes Project CANDLE, an ongoing 3-year project which uses various corpora and NLP technologies to construct an online English learning environment for learners in Taiwan. This report focuses on the interim results obtained in the first eighteen months. First, an English-Chinese parallel corpus, Sinorama, was used as the main course material for reading, writing, and culture-base...
متن کاملOnline Verb-Noun Collocation Instruction with the Support of a Bilingual Concordancer
Studies on language learning have revealed that collocation knowledge is an important indicator for advanced second language competence. Native speakers' production relies heavily on prefabricated or pre-made chunks such as collocations, which save processing load to achieve language fluency and accuracy. Previous investigation on ESL and EFL students indicated that nonnative learners' collocat...
متن کاملTS3: an Improved Version of the Bilingual Concordancer TransSearch
Computer Assisted Translation tools remain the preferred solution of human translators when publication quality is of concern. In this paper, we present our ongoing efforts conducted within TS3, a project which aims at improving the commercial bilingual concordancer TransSearch. The core technology of this Web-based service mainly relies on sentence-level alignment. In this study, we discuss an...
متن کاملEnhancing the Bilingual Concordancer TransSearch with Word-Level Alignment
Despite the impressive amount of recent studies devoted to improving the state of the art of Machine Translation (MT), Computer Assisted Translation (CAT) tools remain the preferred solution of human translators when publication quality is of concern. In this paper, we present our perspectives on improving the commercial bilingual concordancer TransSearch, a Web-based service whose core technol...
متن کاملFidaPLUS corpus of Slovenian
The paper describes the FidaPLUS corpus which is an upgrade of the Slovenian reference corpus. The corpus has been improved on various levels: size, up-todateness, quality of linguistic annotation (lemmatization, POS-tagging), availability and user-friendliness of the on-line concordancer. It has also been implemented in the Sketch Engine software which produces one-page automatic, corpus-based...
متن کامل